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ORACLE CONFIDENTIAL 

A GRAPHICAL CONDITION BUILDER FOR FACILITATING DATABASE 

QUERIES 

RELATED APPLICATIONS 

[0001] This application claims priority benefit of U.S. Provisional Patent 

Application, Serial Number 60/413,914, filed September 25, 2002, which is hereby 
incorporated by reference in its entirety. 

FIELD OF THE INVENTION 

[0002] The invention relates to User Interfaces (UI) in general, and more 

specifically to a UI for writing search queries. 

BACKGROUND 

[0003] Query construction can be thought of as a two-stage process having an 

assembly stage followed by a refinement stage, as discussed by Willis et al. "Users' 
Models of the Information Space: The Case for Two Search Models," Proceedings of the 
ACM Special Interest Group on Information Retrieval (SIGIR) '95, (Seattle WA, 1995), 
ACM Press, pp. 205-210 and Plaisant et al., "Interface and Data Architecture for Query 
Preview in Networked Information Systems," ACM Transactions on Information 
Systems, 17, 3 (July 1999), pp. 320-341. The assembly stage gathers together relevant 
information, and the refinement stage iteratively narrows and nests the user's focus until 
a desired set is achieved. 

[0004] However, use of Boolean operators is difficult for novice users because, 

for example, common language usages of these terms can be the opposite of their usage 

in Boolean expressions, as discussed by Pane et al., "Tabular and Textual Methods for 

Selecting Objects from a Group," Proceedings of VL 2000: IEEE Int. Symposium on 

Visual Languages (Seattle, WA, September 2000), IEEE Press, pp. 157-164 and Young 
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et al. As an example of the confusion with common language, assume that a user wants 
to extract all records associated with two cities, Boston and New York. Under these 
circumstances, if the user relies on common language parlance, the user may logically 
request all records from "Boston AND New York," which produces a Boolean 
intersection of sets of records associated with these cities, and is likely to result in no 
records being returned, depending on the nature of the data in the database being 
searched. In contrast, the correct request would be a union operator using a Boolean 
search query of "Boston OR New York." However, the naive user may incorrectly 
believe that the "OR" means that only one of these cities would be considered. 
[0005] Another difficulty a user may have is that in text queries, parentheses are 

used to establish precedence among the operators, which may be poorly understood by 
novice users, as discussed by Michard, "A New Database Query Language for Non- 
Professional Users: Design Principles and Ergonomic Evaluation," Behavioral and 
Information Technology, Vol. 13 (1982), pp. 279-288, and Young et al., cited above. 
Nested parentheses are a particular source of difficulty for many users, as also discussed 
by Michard. In particular, when there are many levels of nesting the user may get 
confused as to which two parentheses form the pair that delineate the level the user is 
interested in. 

[0006] As discussed by Jones, "Dynamic Query Result Previews for a Digital 

Library," Proceedings of the 3 rd ACM Conference on Digial Libraries (Pittsburgh, PA, 
May 1998), ACM Press, pp. 291-292, and "Graphical Query Specification and Dynamic 
Result Previews for a Digital Library," Proceedings of the 1 1 th Annual ACM Symposium 
on User Interface Software and Technology (San Francisco, CA, November 1998), ACM 
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Press, pp. 143-151, and Plaisant et al., "Interface and Data Architecture for Query 
Preview in Networked Information Systems," ACM Transactions on Information 
Systems, Vol. 17, No. 3 (July 1999), pp. 320-341, a graphical representation of the query 
can result in substantial benefits during the refinement phase of query building. Potential 
advantages include fewer zero-hit queries, reduced network activity due to the prevention 
of retrieval of undesired records, improved query comprehension, and better support for 
browsing within an unfamiliar database. 

[0007] Query preview is available in prior prototypes that support dynamic 

querying, using direct manipulation to construct queries, as disclosed by Ahlberg et al., 
"Dynamic Queries for Information Exploration: An Implementation and Evaluation," 
Proceedings of ACM CHI '92, (1992), ACM Press, pp. 619-626, Malan et al., "Visual 
Query Tools for Uncertain Spatio-Temporal Data," Proceedings of the 9 th ACM 
International Conference on Multimedia (Ottawa, Canada, September 2001), ACM Press, 
pp. 522-524, and Plaisant et al., "Dynamaps: Dynamic Queries on a Health Statistics 
Atlas," Proceedings of ACM CHI '94 (Boston, MA, April 1994), ACM Press, pp. 439- 
440. However, query preview may add a significant amount of processing overhead. 
[0008] Query formulations that include tabular layouts of cards are disclosed by 

Pane et al., "Improving User Performance on Boolean Queries" Proceedings of CHI 2000 
Extended Abstracts (The Hague, Netherlands, April 2000). ACM Press, pp. 269-270, and 
"Tabular and Textual Methods for Selecting Objects from a Group," Proceedings of VL 
2000: IEEE Int. Symposium on Visual Languages (Seattle, WA, September 2000), IEEE 
Press, pp. 157-164. Similarly, boxes that may overlap in horizontal or vertical 
dimensions are used by Anick et al, "A Direct Manipulation Interface for Boolean 
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Information Retrieval via Natural Language Query," Proceedings of ACM Special 
Interest Group on Information Retrieval (SIGIR) '90, ACM Press, pp. 135-150. 
[0009] Young et al., "A Graphical Filter/Flow Representation of Boolean 

Queries: A Prototype Implementation and Evaluation," Journal of the American Society 
of Information Science, Vol. 44, No. 6 (1993), pp. 327-339, introduced a Filter/Flow 
model for query building to overcome issues related to the limited Boolean knowledge of 
end users. Young et al.'s prototype relied upon a graphical representation of parallel 
branching and constrictions in data flowing to represent OR's and AND's in a Boolean 
query. 

[0010] FIG. 1 shows prior filter flow interface 100 according to Young et al. 

Filter flow interface 100 has filters 102, 104, 106, 108, and 110, and has dataflow lines 
122, 124, 126, 128, and 130. The width of each of dataflow lines 122, 124, 126, 128, and 
130 represents the amount of data retrieved after application of those of filters 102, 104, 
106, 108, and 1 10 that precede that dataflow line. Filters 102, 104, 106, 108, and 1 10 
filter the collection of data by limiting the total amount of data in the results as depicted 
by the dataflow lines 122, 124, 126, 128, and 130. As the data is filtered, the dataflow 
lines get narrower. An analogy can be made to dams on a river constricting the flow of 
the river and filters 102, 104, 106, 108, and 1 10 (analogous to the dams) constricting 
dataflow lines 122, 124, 126, 128, and 130 (analogous to the rivers) by limiting the 
amount of data remaining in the search results (analogous to the amount of water 
remaining in the river). 

[0011] In FIG. 1, filters located on dataflow lines arranged parallel to one another 

are logically ORed with one another forming a union of the two sets of data, while filters 
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located on dataflow lines that are serially connected are logically ANDed with one 
another forming the intersection of the two sets of data. In the example depicted in FIG. 
1 , the search query represented will find any document having a location of Georgia 
(filter 102, filtering dataflow line 122 to produce dataflow line 124) including either 
clerks (filter 104, filtering dataflow line 124 to produce dataflow line 126) with an 
income of $40,000 or more (filter 106, filtering dataflow line 126 to contribute to 
dataflow line 128), or all accountants and engineers (filter 108 also filtering dataflow line 
124, but producing dataflow line 130) having Elizabeth as their manager (filter 110 
filtering dataflow line 130 to also contribute to dataflow line 128). Dataflow line 128 that 
include the output of filter 1 10 ORed with the output of filter 106 as indicated by filters 
106 and 1 10 being located on parallel paths including parallel dataflow lines 126 and 
130. 

[0012] Using Young et al.'s Filter/Flow model, users comprehended and 

constructed queries more accurately, compared to users of all text-based queries. Users 
also strongly preferred the graphical interface over the text query interface. However, the 
Filter/Flow model by Young et al. is limited in its flexibility and was not connected to a 
database. 
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SUMMARY OF THE INVENTION 

[0012] The present condition builder provides several features that can each be used 
independently of one another or with any combination of the other features. Although 
many of the features of the present condition builder are motivated by the problems 
explained above, any individual feature may not address any of the problems discussed 
above or may only address one of the problems discussed above. Some of the problems 
discussed above may not be fully addressed by any of the features of the present 
condition builder. 

[0013] A graphical condition builder is provided that presents a graphical 
representation of a search query. In an embodiment, a graphical and a numerical 
representation of expected sizes of results sets is provided simultaneously within the 
preview representation of a query's likely search results. In an embodiment, the 
condition builder provides a graphical representation of groups of filter objects, which 
can be used to affect the order of operations of the filter objects in a manner similar 
(although not necessarily identical) to parentheses within a Structured Query Language 
(SQL) clause. In an embodiment, at least some logical operations are automatically 
determined by the placement of an icon representing an object filter within a graphical 
representation of a query. In an embodiment, a graphical representation of a NOT 
operation is provided, where the NOT operation inverts one or more results sets. In an 
embodiment, a Boolean query is divided into steps and each step may contain multiple 
filter objects. Optionally, the steps can be individually enabled, disabled, or otherwise 
controlled. In an embodiment, the preview of the size of a results set is represented using 
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graphical preview portions that narrow continuously. In an embodiment, the graphical 
preview portions have a shape that is suggestive of a funnel. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0014] The present invention is illustrated by way of example, and not by way of 
limitation, in the figures of the accompanying drawings and in which like reference 
numerals refer to similar elements and in which: 

[0015] FIG. 1 shows a prior art search query interface, which was discussed above. 
[0016] FIG. 2 shows an embodiment of a search query interface according to the 
invention. 

[0017] FIG. 3 shows a search query interface similar to that of FIG. 2 having a group 
box that is opened. 

[0018] FIG. 4 shows the group box of FIG. 3 with its group of filter objects selectable 
for editing. 

[0019] FIG. 5 shows an embodiment of the invention in which the search query is 
divided into query steps. 

[0020] FIG. 6 shows an embodiment of a user interface for entering the search query. 
[0021] FIG. 7 shows a second embodiment of the user interface. 
[0022] FIG. 8 shows an alternative embodiment of the search query interface. 
[0023] FIG. 9 is a block diagram that illustrates a computer system 900 upon which 
an embodiment of the invention may be implemented. 
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DETAILED DESCRIPTION OF SOME EXAMPLES OF THE INVENTION 
[0024] In general in this specification, the contents of each figure will be discussed 
by first briefly describing each labeled item, primarily to establish a list of each labeled 
feature of the figure, and then a more detailed description of the drawing will follow. 
Each of the figures is generally discussed in the order in which it is numbered. Although 
headings are provided, information related to a particular heading not found in the section 
having that heading may also be found elsewhere in the specification. 

OVERVIEW OF GRAPHICAL CONDITION BUILDER 

[0025] FIG. 2 shows a graphical condition builder 200 having a canvas 201 and an 
item container 202, which includes source designators 204a- f. Source designators 204a-f 
identify the set of data against which the query is to be executed. In the present example, 
the query is to be executed against data from a database. Thus, source designators 204a-f 
represent a specific set of information from a specific set of tables within the database. 
[0026] In FIG. 2, source designators 204b and 204d respectively represent Sale 
Details and Cost Details, which correspond to sets of data items from the database. On 
the other hand, source designators 204a, 204c, 204e and 204f respectively represent data 
about Channels, Countries, Products, and Years, which correspond to dimensions of the 
"Sales Details" and "Cost Details" data items. Each individual data item in the "Sales 
Details" data set and the "Cost Details" data set will be associated with a particular value 
for each dimension. Thus, as will be illustrated hereafter, dimensional source designators 
204a, 204c, 204e, and 204f can be used for filtering, ordering or arranging the "Sales 
Details" and "Cost Details" data sets. 
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[0027] Graphical condition builder 200 also has filter icons 206a-g, which are joined 
to one another by dataflow lines 220a-g. Each of filter icon 206 represents a condition 
that affects which data is satisfied by the query. The data flow lines that flow down into 
a particular filter icon 206 represents the set of data items that would be returned by the 
elements of the query that precede the particular filter icon 206. Conversely, the data 
flow lines that flow down from the particular filter icon 206 represents the set of data 
items that would be returned by combining the condition that is represented by the 
particular filter icon 206 with those previous elements. 

[0028] For example, the data flow line 220a that flows down into filter icon 206b 
represents the set of data items that would be returned by the condition associated with 
filter icon 206a. Conversely, the data flow line 220b that flows down from filter icon 
206b represents the set of data that would be returned by combining the condition that is 
represented by the filter icon 206b with the condition represented by filter icon 206a. 
[0029] Under each of filter icons 206a-g is one of descriptive titles 208a-g. Each of 
descriptive titles 208a-g indicates the condition that is associated with the filter icon 
206a-g to which it corresponds. For example, descriptive title 208b is "Top 10 
Customers", which indicates that the corresponding filter icon 206b represents a 
condition that limits the query to data items that are associated with the top 10 customers. 
[0030] In the vicinity of each icon is one of graphical preview portions 230a- f. In the 
illustrated embodiment, each graphical preview portion is associated with a stage of the 
query, and has two lateral members. The relative distance between the tops of the lateral 
members reflects the relative quantity of data items that satisfy the query elements that 
are prior to (e.g. above) the stage. The relative distance between the bottoms of the 
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lateral members reflects the relative quantity of data items that satisfy the query after the 
stage. For example, the distance between the tops of the lateral members of portion 230b 
represents the relative amount of data that would flow through data flow line 220a. On 
the other hand, the distance between the bottoms of the lateral members of portion 230b 
represents the relative amount of data that would flow through data flow line 220b. 
[0031] Also included in the graphical condition builder are OR indicators 232a and b, 
logical operator buttons 240, 242 and 244, and group button 246. In alternative 
embodiments, condition builder 200 may not have all of the features listed above or 
otherwise associated with FIG. 2, and/or may have other features in addition to or instead 
of the features listed above or otherwise associated with FIG. 2. 

[0032] Graphical condition builder 200 may be used in any context in which Boolean 
expressions are constructed or presented, such as in conjunction with a variety of search 
engines. There is virtually no limit to the type of search engine with which the graphical 
condition builder 200 may be used. For example, graphical condition builder 200 may be 
used in conjunction with search engines that run searches against tables in a database, 
web pages, and/or documents. Accordingly, the term "item" will be used for the basic 
unit of information for which a searched is being performed. Also, although commonly 
one speaks of the number of "hits" found by a search, this application refers to the 
number of items found in a search, which is to be understood to mean the same thing. 
[0033] In graphical condition builder 200, the search query is constructed graphically 
on canvas 201. As mentioned above, source indicators 204a- f generally represent the 
categories of information from which the query is to retrieve data from a source. For 
example, the source may be a database, and source indicators 202a-f may represent 
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specific tables, or specific columns from tables, within the database, from which data is 
to be extracted. 

[0034] The collection of source objects (e.g., source designators 204a-f) within item 
container 202 effectively represents the initial scope of a query. Subsequent condition 
filters may apply to all objects (e.g., source designators 204a-f) in item container 202, as 
in a relational database, or may only apply to one dimensional objects, as in a 
multidimensional (OLAP) database. The initial scope of analysis can be broadened or 
narrowed by selecting and highlighting target objects (e.g., source designators 204a-f) 
within the item container 202. 

[0035] The filter objects are represented by filter icons 206a-g. The filter objects 
represent filters that restrict which information is to be extracted from the source. The 
filter objects associated with filter icons 206a-g may represent restrictions on one or more 
of the source designators 204a-f in item container 202. For example, source designator 
204(e) is products, and filter icon 206(d) filters all but the top 10 products (where the 
products are ranked based on the number of orders). 

[0036] Similar to filter flow interface 100 (FIG. 1) filters associated with those of 
filter icons 206a-g that are located along those of dataflow lines 220a-d that are 
connected serially are being ANDed with one another. In the embodiment depicted in 
FIG. 2, the flow is from the top of canvas 201 towards the bottom of canvas 201, because 
it conforms with an intuitive notion of water flowing from high places to low places. 
However, in alternate embodiments the flow may run from bottom to top, left to right, 
right to left, diagonally, or any other direction. The conditions associated with those of 
filter icons 206a-g that are located on those of dataflow lines 220a-d that are parallel to 
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The OR operation is additionally indicated by 



"AND" AND "OR" INDICATORS 

[0037] A user interface mechanism is provided that allows filter icons to be created 
by selecting a source designator 204 from item container 202 and dragging and dropping 
the icon onto a desired position on the canvas 201 . The user can specify and/or modify 
the condition for the filter either before or after dragging it to this location. For example, 
filter icon 206g can be created by dragging the source designator 204f for "Years" from 
container 202, and then modifying the condition to specify the condition "Last 3 Years". 
[0038] According to one embodiment, the position at which a filter is placed 
determines how the corresponding condition is combined with the already-specified 
conditions of the query. For example, in one embodiment, if a filter icon is placed 
horizontally adjacent to one or more existing icons, then the condition associated with the 
filter icon is ORed with the conditions of the horizontally adjacent icons. On the other 
hand, if a filter icon is placed above or below existing filter icons, then the condition 
associated with the filter icons is ANDed with the conditions associated with the other 
icons. Temporary ghost image lines are observable during the process of dragging a 
condition, to allow the user to understand whether the condition will be ANDed or ORed 
if dropped at a specific location. 

[0039] Once placed, a user interface mechanism is provided that allows filter icons 
206a-g to be moved from their present location by dragging and dropping them to change 
the type of operation being performed between the filters associated with icons 204a-f 
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and thereby change the query. In an embodiment, as filter icons 206a-g are being 
dragged on canvas 201, a ghost image appears displaying either an AND indicator or an 
OR indicator, depending upon whether the icons are vertically aligned (one above the 
other) or horizontally aligned (side by side). In an embodiment, the AND indicators only 
appear while icons are in the process of being moved, but not after they have been 
dropped into a position. 

[0040] For example, if filter icons 206e and 206f are moved so that they are vertically 
aligned, then the dataflow lines 220c and 220d automatically rearrange themselves to 
show filter icons 206e and 206f being located along dataflow lines connected serially, 
and OR indicator 232b automatically disappears. Similarly, were filter icons 206a and 
206b moved so that they are located to the sides of one another, then dataflow lines 220a 
and 220b are automatically rearranged to show filter icons 206a and 206b being located 
on parallel dataflow lines, and an OR indicator automatically appears, and the search 
query is altered so that the conditions associated with filter icons 206a and 206b are 
ORed. The ability to dynamically preview results of a change in the search query by 
moving search query components on canvas 201 facilitates the user iteratively altering 
the search until the number of items or hits found by the search query is within a desired 
range. 

[0041] In alternative embodiments, the AND indicator may also appear after the 
icons are dropped into position, the OR indicator may not appear after the icon is 
dropped, the AND and/or the OR indicator may not appear while the icons are being 
moved, and/or there may not be AND and/or OR indicators. 



50277-2252 

(ODD No. 2002-175-01) 



14 



ORACLE CONFIDENTIAL 

[0042] In the example depicted in FIG. 2, the logical operations of AND and OR can 
also be added using logical operator buttons 240 and 242. For example, graphical query 
operator 200 may support construction or altering a search query by (1) selecting a 
location within graphical condition builder 200, (2) selecting the icons representing filters 
already in the query (on canvas 201) or by selecting two of source designators 204a-f 
between which the operators will be applied, and then (3) selecting the appropriate one of 
logical operator buttons 240 and 242. In response two icons will be automatically placed 
on canvas 201 along the dataflow line at the location selected either in parallel or in series 
according to the logical operator selected. Alternatively, the new icons may be placed on 
canvas 201 at a default location, such as at the end of the search query diagram. 

NOT OPERATOR 

[0043] The logical NOT operation is also supported by graphical condition builder 
200. The logical NOT inverts the results so that the items that would have been included 
without the NOT operator are excluded from the search results, and the items that would 
have been excluded without the NOT operator are included within the search results. In 
an embodiment, the depiction of an X on a dataflow line may indicate the performance of 
a logical NOT operation. In alternative embodiments, the NOT operation may be 
indicated by placing the indicia "NOT" in addition to or instead of the X on the dataflow 
line. The NOT operator may be added to a search using the logical operator button 244. 
For example, a NOT operator may be added by selecting operator button 244 and then 
selecting the location within the graphic illustration of the search query to insert the not 
operator. According to one embodiment, a user can drag an icon representing the NOT 
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operator over the canvas, and drop the icon at the desired location within the graphical 
representation of the query. 

GRAPHICAL PREVIEW PORTIONS 

[0044] Graphical preview portions 230a-f are visually distinct graphical preview 
indications that show how the collection of items found is narrowed (or widened, as may 
be the case when a NOT operator is placed on an existing portion of a query) by the 
application of each filter object. As mentioned above, the widths of graphical preview 
portions 230a-g are proportional to the number of documents expected to be in the results 
set at the corresponding point in the query flow, and graphical preview portions 230a-g 
may have shapes suggestive of funnels or cones, for example. The top width of each of 
graphical preview portions 230a- f represents the size of the results set being filtered by 
the associated filter object prior to applying the filter associated with the filter object. 
The bottom width of each of graphical preview portions 230a-f represents the expected 
size of the results set of the associated filter object after being filtered. 
[0045] In contrast to the widths of dataflow lines 122-130 (FIG. 1), the widths of 
graphical preview portions 230a-f typically narrow continuously. Although in the 
embodiment depicted in FIG. 2 the dataflow lines all have the same width, dataflow lines 
of varying widths that depend on the number of hits or items in a local results set may be 
used in combination with or instead of graphical preview portions 230a-f. 
[0046] In the embodiment shown in FIG. 2, the walls of graphical preview portions 
230a-f are straight, and there is an abrupt change in the slope of the side walls or at least a 
change that is not graphically smooth at the transition between graphical preview portions 
230a-f. Additionally, there are lines demarking the border between graphical preview 
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regions. These features facilitate distinguishing the transition between graphical preview 
regions 230a-f. However, in alternative embodiments the side walls and/or boundary 
demarcation lines may be curved (concave and/or convex), the slopes of the side walls at 
the transitions between graphical preview portions 230a-f may be graphically smooth 
(e.g., at the transitions, the slopes may appear to be mathematically continuous and the 
walls may appear mathematically smooth), and/or there may not be any lines demarking 
the borders between adjacent graphical preview portions 230a-f. 

ORDER DEPENDENT FILTER OBJECTS 

[0047] Having a direction of flow indicated by dataflow lines 220a-g facilitates 
having order dependent filter objects, such as the top 5% of the documents previously 
found. In FIG. 2 the filter objects associated with filter icons 206b-d are order dependent 
as indicated by descriptive titles 208b-d, having titles "Top 10 Customers," "Top 10 
prods by sales," "Top 10 prods by # orders" respectively. For example, in an extreme 
case, the top 10 customers may not purchase any of the top ten products by sales. Thus, 
if the top 10 products by sales filter, the filter associated with filter icon 206b is applied 
first, and the top ten customers filter, the filter associated with filter icon 206c, is applied 
second the result will be 0 products. In contrast, the top ten products by sales of the 
products bought by the top ten customers may be 90% of the top ten customers' 
purchases, and the top ten customers' purchases may be 20% of all products sold. 
Consequently, reversing the order of the filters associated with filter icons 206b and 
206c, and applying the top ten customers filter first (the filter associated with filter icon 
206c) and then applying the top ten products by sales filter (the filter associated with 
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filter icon 206b) may result in a item including 18% of the products sold. The effect of 
the ordering of the filters associated with filter icons 206b-d may be readily previewed by 
moving filter icons 206b-d on canvas 201 to have a different order in the dataflow. 
[0048] Order dependence is often the result of the filter operation being performed on 
the previous result. In other words, the input to the filter is the results set obtained from 
the previously applied filter. In an embodiment, filter objects are included that have their 
input from the entire database prior to any filtering, and which are therefore order 
independent. For example, a filter may be included that for the top 10% of all sales, 
rather than the top 10% of the sales of the data set from the search results of the output of 
the previously applied filter. 

GROUPS 

[0049] Group button 246 is used to represent a group of filter objects as one filter 
object. Any portion of the search query having one incoming dataflow line and one out 
going dataflow line can be grouped together into a single filter represented by a single 
icon. Filter icon 206a represents such a group filter object. In alternative embodiments, 
each icon representing a group filter object may have multiple input and/or output 
dataflow lines. 

[0050] In an embodiment, the user can select multiple icons and then click on group 
button 246 to create an icon representing the group of filters corresponding to the icons 
selected. In an embodiment, selecting group button 246 may create an empty group icon 
within which icons representing filter objects may be placed (by dragging and dropping). 
Clicking on filter icon 206a opens the group icon, allowing the user to view and/or edit 
the group represented by filter icon 206a. 
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[0051] After a portion of the search query (which may involve many filters and 
operators) has been established as a group, the icon that represents the group may be 
manipulated (e.g. dragged and dropped to other locations within the query) to move the 
entire portion of the query as a single unit. 

[0052] In an embodiment, the user can select a set of discontiguous filter icons 
(representing objects) before clicking on group button 246 to create a group. In this case, 
two or more objects having their filter icons vertically located at the same level on the 
dataflow will be ORed within the group. Similarly, two or more selected filter icons that 
are horizontally co-located, though vertically separated, on the dataflow will have their 
respective objects ANDed within the group. In an embodiment, when situations arise 
where two or more objects have their filter icons located so that it is ambiguous whether 
the user are intended them to be ANDed or ORed within the group, a preference will be 
given to ANDing the objects. In an alternative embodiment, a preference will be given to 
ORing the objects. Boolean representations within the group object that were not 
intended by the user can be easily remade, using existing UI rules for ANDing and ORing 
objects. 

NUMERICAL PREVIEW INDICATIONS AND MORE ON GROUPS 

[0053] FIG. 3 shows another graphical query window 300 having a canvas 301, icons 

302, 304, and 306, a hand button 307, a group object window 308, a representation of a 
group of objects 309, and numerical preview indicators 310, 312, 314, 316, and 318. 
[0054] Group object window 308 represents an open state of a group object. Double- 
clicking on a group icon causes the display of the group icon to be replaced with group 
object window 308. Group object window 308 includes the representation of the portion 
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of the query that has been established as part of the group, which may include group of 
objects 309 including filter objects, operator icons. 

[0055] The group of objects 309 contained within group object window 308 may 
even include one of more other group objects. Similar to parentheses in the text-based 
representation of Boolean expressions, group objects allow the user to organize the filter 
objects of a query in a manner according to the user's desires, so that it is easier for the 
user to gain an overview of a complex query. However, in contrast to parentheses, the 
groupings formed by group objects are easy to visualize, and do not have the confusions 
associated with parentheses, especially nested parentheses. 

[0056] In an embodiment, group objects may affect the order in which operations are 
performed in a similar fashion as parentheses affect the order in which operations are 
performed within a logical expression. Specifically, the objects within the group are 
evaluated and then the group object is applied after filter objects earlier in the query flow 
have been applied. By providing group objects that affect the order of evaluation of filter 
objects, the confusions associated with parentheses can be avoided while retaining 
ordering capabilities similar to parentheses. However, in alternative embodiments, the 
group object does not affect the order of evaluation of the filter objects contained within. 
[0057] Numerical preview indicators 3 1 0, 3 1 2, 3 1 4, 3 1 6, and 318 indicate the 
number of items expected to be found at each point during the query after applying the 
preceding filter objects. For example, numerical preview indicator 310 shows that there 
are 42,636 items that have been found before applying the filter objects associated with 
icons 302, 304, 306 and group object window 308. Numerical preview indicator 312 
shows that after applying the filter of object 302, there are only 35,509 items or hits 
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expected to be found in the pool of data, while numerical preview indicator 318 shows 
that after applying the series of filter objects associated with icons 302, 304 and 306 and 
ORing them with the group of filter objects associated with group object window 308, 
there are only 319 items that are expected to be found by the search query. 
[0058] Numerical preview indicators 3 10, 3 12, 3 14, 3 16, and 3 1 8 may be displayed 
prior to performing a search giving an estimate of the number of items or hits that will be 
found. Numerical preview indicators 310, 312, 314, 316, and 318 clarify the effects of 
AND and OR operators, thereby reducing the user's need to have a prior knowledge of 
Boolean logic. 

[0059] Numerical preview results can be obtained by at least three alternative 
methods. (1) Interim database queries can quickly return a row count for small, and/or 
non-complex databases. (2) Algorithms have been developed to estimate the size of 
returned query results from a large and/or complex database, without executing a time- 
consuming query. These rely upon statistics obtained from metadata housed within the 
database. (3) Algorithms have been developed to provide very rough indications of 
returned results, by using only the combination of AND and OR operators combined with 
the initial size of the database, without executing actual queries. 
[0060] FIG. 4 shows graphical query window 300 with the representation of the 
group of objects 309 expanded within group object window 308. Each of the objects 
within group 309 is represented by a separate icon, including filter icons 402, 404, and 
406, numerical preview indicators 408, 410 and 412, dataflow lines 414 and 416, and OR 
indicators 418 and 420. Similar to the larger query representation on the canvas, the 
various objects in the group of objects 309 are connected by dataflow lines. In an 
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embodiment, each filter object can be selected and edited by selecting its corresponding 
icon 402, 404, and 406 just as if they were not included within group object window 308. 
For example, numerical preview indicators 408, 410 and 412, dataflow lines 414 and 416, 
and OR indicators 418 and 420 are visible and behave no differently than if they were 
located directly on canvas 301. 

[0061] In the embodiment depicted in FIG. 4, hand button 307 may be used to cause 
hand 407 to appear. Hand 407 is used to move the image within group object window 
308 to see different parts of the group of objects 309, should the group of objects 309 be 
too large to fit within the window of group object window 308. In an alternative 
embodiment, the window of group object 308 includes horizontal and/or vertical scroll 
bars, and can be scrolled in order to see different parts of the group. An embodiment 
includes a user interface mechanism that allows a user to scroll through group object 
window 308 using directional keys, a mouse, track ball, touch sensitive pad, or other such 
navigational or cursor control device. Other embodiments may include any combination 
of hand 307, scroll bars, or any other navigational software or hardware may be available 
for the user. 

[0062] In an embodiment, objects within a group container can be viewed at one of at 
least two zoom levels, as controlled by a magnifying glass icon on the opened group UI 
(group object window 308 of group of objects 309), for example. At one or more of the 
zoomed-in levels, objects (as represented by filter icons such as filter icons 402, 404, and 
406) within group object window 308 appear at the same size and resolution as filter 
objects 302, 304, and 306 appear on the dataflow and/or canvas 301 of query editor 
window 300. Only a subset of filter icons representing objects contained within group 
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object window 308 will normally appear when viewed at a zoomed-in level as depicted in 
FIG. 4, for large sets of grouped objects. At the zoomed-out level, all filter icons 
representing objects of group of objects 309 within the group container appear within 
group object window 308 as depicted in FIG. 3, but at a relatively coarse level of detail 
that does not display object labels. In an embodiment, in the zoomed-out state of FIG. 3, 
however, lines still indicate Boolean connections between objects, and objects can still be 
selected, dragged, and dropped in a similar way as objects in the' parent query editor 
window. In this manner, users can explore the contents of a large group at a high-level 
view, then zoom-in to work with specific contained objects. 
[0063] In an embodiment, group containers can contain other groups, as well as 
condition filters. The hierarchical containment of groups can provide a powerful and 
flexible graphical analogy for nested sub-queries, much like nested parentheses within 
SQL statements. 

QUERY STEPS 

[0064] FIG. 5 shows query "steps" 502, 504, 506, and 508. In this context, a "step" 
corresponds to a stage of the query. In the example illustrated in FIG. 5, the query is 
divided into four stages, each of which is associated with a step. Within query step 502 is 
group icon 510, and within query step 504 is icon 512. Within query step 506 are icons 
514 and 516, and OR indicator 518. Within query step 508 are icons 512, 522, and 524, 
NOT indicator 526, and OR indicator 528. Each of query steps 502, 504, 506, and 508 is 
associated with one of tabs 530, 532, 534, or 536. Tab 530 includes window 538 and 
enable button 540. Although not labeled or discussed further, each of tabs 532, 534, and 
536 also have a window and enable button that function similar to window 538 and 
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enable button 540. In alternative embodiments, condition builder 500 may not have all of 
the features listed above or otherwise associated with FIG. 5, and/or may have other 
features in addition to or instead of the features listed above or otherwise associated with 
FIG. 5. For example, step containers could be represented as shaded areas underlying 
specific portions of a query. 

[0065] Query steps 502, 504, 506, and 508 act as containers for some portion of the 
query. Similar to a group, in an embodiment, there is one input dataflow line and one 
output dataflow line for each one of query steps 502, 504, 506, and 508. However, in 
alternative embodiments there may be multiple input and/or output dataflow lines for any 
or all steps. 

[0066] Each query steps 502, 504, 506, and 508 has various attributes and/or 
properties associated with it. Some of the attributes associated with a step are whether 
the step is enabled or disabled, the order in which the step appears in the search, and the 
manner in which the search results of the step are combined with the rest of the search 
query. Many of the attributes and/or properties of query steps 502, 504, 506, and 508 are 
indicated on their respective tabs 530, 532, 534, and 536, which serve as a user interface 
mechanism for viewing and manipulating these attributes and/or properties. 
[0067] Regarding the attribute or property of the order in the search, each of query 
steps 502, 504, 506, and 508 are numbered 1, 2, 3, and 4, as indicated on tabs 530, 532, 
534, and 536, respectively, to indicate the order in which their associated conditions are 
applied relative to one another. The order of query steps 502, 504, 506, and 508 may be 
changed by dragging one of query steps 502, 504, 506, and 508 to a different position 
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within the sequence of query steps automatically causing a change in the values of 
numbers 1, 2, 3, and 4 to reflect the new order. 

[0068] In alternative embodiments, in addition to or instead of dragging and dropping 
query steps 502, 504, 506, and 508, the order of query steps 502, 504, 506, and 508 may 
be changed by changing the values of their associated numbers. In an embodiment, 
changing the numbers 1, 2, 3, and 4, may automatically change the position of the their 
associated query steps 502, 504, 506, and 508. Alternatively, changing the values of 
numbers 1, 2, 3, and 4 may only change the order in which the steps are applied but not 
their physical location on the screen. There is no limit to the number of objects that can 
be placed, and the complexity of the portion of the search query within each of query 
steps 502, 504, 506, and 508. In an embodiment, steps may be mixed with Boolean logic 
not included in any step. For example, steps may be added to the end or anywhere else in 
the search query of FIG. 2. In an embodiment, a new step may be automatically added to 
the end of a search query such as that of FIG.2 or FIG. 5. 

[0069] Regarding the attribute or property of how the search results of the step are 
combined with the rest of the search, a window may be provided for selecting a method 
of combination. Using tab 530 as an example, window 538 indicates an attribute or 
property of how query step 502 is applied. In an embodiment, window 538 includes a 
pull down menu with choices, such as "keep," "add," and "remove," which determine 
whether the local results set of query step 502 (found as a result of applying only the 
filter associated with icon 510 regardless of any prior steps in the query flow) are the only 
items or hits kept, added to the global results set, or removed from the global results set, 
respectively. For example, if window 538 is used to select "keep," then the search results 
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from step 502 are kept and the only records in the data set are those of the customer 
group of group icon 510, which is the same data set as if the filter associated with icon 
510 were applied without being included in a query step. If the window 538 is used to 
select "remove," then the result of applying the filter associated with icon 510 is removed 
from the input data set, which in the case of step 502 the input data set is the entire 
original dataset. Thus, the output of step 502 is the entire database with customer group 
associated with group icon 510 removed. If the window 538 is used to select "add" then 
the customer group associated with group icon 510 is added to the input of query step 
502. However, since the customer group associated with group icon 510 was already part 
of the original dataset, the output of step 502 is the entire original dataset. However, 
were the filter associated with icon 512 the top 10 customers of the entire original dataset, 
and were step 502 set to "keep" and step 504 set to "add," then the output of step 504 
would be the union of the customer group associated with group icon 510 and the top 10 
customers of the entire data set. This allows a user ignorant of Boolean logic to perform 
a search without learning Boolean logic. In an alternative embodiment, the steps are 
combined with the rest of the query using or with other steps using the Boolean logic 
within the step. 

[0070] In the illustrated embodiment, each step also includes an "enable" attribute or 
property that a user can turn ON or OFF by interacting with an enable icon 540. When 
the enable attribute of a step is set to ON, the step (with its enclosed objects) is enabled. 
Conversely, when the enable attribute is OFF, the step is disabled. 
[0071] A query having a disabled query step is equivalent to the same query with all 
of the Boolean logic associated with the step having been removed. For example, were 
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step 502 disabled by selecting and thereby toggling the enable icon 540, then the output 
of step 502 would be the same as its input, which is the entire original dataset. 
Consequently, the input to step 504 would also be the entire original dataset. The 
disabled query step (e.g., query step 502), nonetheless, still physically appears in the 
query flow, so that the query step can be enabled without recreating it. The capability of 
being able to enable and disable individual query steps especially in combination with the 
numerical preview indicators (shown in FIGs. 3 and 4) allows the user greater flexibility 
in comparing the affects of each query step on the results of the search query before 
performing the search. In the embodiment depicted in FIG. 5, both query steps and group 
objects are available to the user. However, in alternative embodiments, query steps are 
provided as a replacement to group objects. Although not illustrated, graphical preview 
sections may also be used in conjunction with the embodiment of FIG. 5. In an 
embodiment, steps may contain groups and condition filters, but'may not contain other 
steps. In an alternative embodiment, steps may be nested within other steps. In an 
alternative embodiment, steps may also contain parts of other steps. 

USER INTERFACES 

[0072] FIG. 6 shows a user interface 600 having a query flow editor 602 for 
arranging a search query, an item container 602a having a list of information against 
which the query is to be executed, an item navigator 603 for selecting items and 
conditions having conditions folder 604 including conditions 606-618, an item tab 620 
having a list of items that can be selected, a conditions tab 624 for editing the conditions 
specified in each filter, and an "other" tab 622. In alternative embodiments, user 
interface 600 may not have all of the features listed above or otherwise associated with 
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FIG. 6, and/or may have other features in addition to or instead of the features listed 
above or otherwise associated with FIG. 6. 

[0073] Query flow window 602 is used for graphically constructing a search query. 
Query flow window 602 may include any mixture of features of any combination of 
query flow editors 200, 300, and 500 of FIGs. 2-5 in which the search is visually depicted 
and can be constructed. In the example of FIG. 6, conditions tab 624 is selected, and 
therefore conditions folder 604 is displayed. Each of conditions 606-618 corresponds to 
a different filter condition. In an embodiment, graphical condition builder 600 supports 
clicking on an icon associated with one of conditions 606-618, and dragging and 
dropping the selected one of conditions 606-618 onto canvas 602 to construct a search 
query. Graphical condition builder 600 supports editing the conditions associated with an 
icon representing a filter object already on canvas 602 using the corresponding condition 
in condition folder 604. Item tab 620 may be used for choosing the set of selected items 
within item container 602a and/or conditions folder 604. Other tabs, such as other tab 
622 may be used for opening other folders and/or for performing other database 
operations. Although not illustrated, graphical preview sections may also be used in 
conjunction with the embodiment of FIG. 6. 

[0074] Figure 7 shows an alternative user interface 700 having a query flow pane 
702, query objects pane 704, object attributes pane 706, and status and control pane 707. 
Query flow pane 702 includes new condition button 703a and calculation button 703b. 
Query objects pane 704 includes tree 705 and a list of query objects 708, 710, 712, 714, 
716 and 719, having names 720, 722, 724, 726, 728, 730, and 732, and filter conditions 
734, 736, 738, 740, 742, 744 and 746, respectively. The object attributes pane 706 
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includes a name box 750, an item navigator 752, an operator box 758, a values box 760, a 
delete button 762, a preview button 764, and an advance link 765. The status and control 
pane 707 includes a status line 766, a control bar 768 including undo button 770, redo 
button 772, apply button 774, and OK button 776. In alternative embodiments, user 
interface 700 may not have all of the features listed above or otherwise associated with 
FIG. 7, and/or may have other features in addition to or instead of the features listed 
above or otherwise associated with FIG. 7. 

[0075] Query flow pane 702 is used for graphically constructing a search query. 
Similar to query flow editor 602, query flow pane 702 may include any combination of 
features of any combination of query flow editors 200, 300, and 500 of FIGs. 2-5 in 
which the search is visually depicted and can be constructed, forexample. New 
condition button 703a may be used to add a new query object to the query of query flow 
702. In an embodiment, the new query object is automatically added to the bottom of the 
query flow depicted in query flow pane 702. Calculation button 703b is equivalent to 
calculation tab 622 and has the same function. 

[0076] In an embodiment, each of the filter objects corresponding to the icons of 
query flow window 702 is listed in query objects pane 704. The query objects pane 704 
may also contain query objects that are not included in the query depicted in query flow 
pane 702. New condition button 703a may also add the new query object (that was added 
to query flow pane 702) to query objects pane 704. Alternatively, new condition button 
703a may only add the new query object to query flow pane 702, or in another 
embodiment may only add the new query object to query objects pane 704. In the 
example of FIG. 7, query objects pane 704, query objects 708, 710, 712, and 714 are 
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located on tree 705. However, query objects 716, 718, and 719 may be arranged in a tree 
structure. The tree structure may also reflect group objects. In other words, in an 
embodiment, a group object may be a branch on a tree, for example, and the filter objects 
contained within the group object will be placed as branches branching from the group 
object branch. In alternative embodiments, query objects pane 704 may list filter objects 
without showing any tree structure. Filter objects contained within a group object may 
only be visible if the group object is clicked on or in alternative embodiments may be 
visible next to the group object within which they are contained. 

[0077] Object attribute pane 706 displays the attributes of the object selected in query 
objects pane 704. In the example of FIG. 7, query object C10 (719) is selected, and 
accordingly object attributes pane 706 displays the attribute of query object C10 (719). 
The attributes of the query object displayed in object attribute pane 706 can be edited 
using object attribute pane 706. For example, the name 732 can be changed using name 
box 750, the item being filtered can be changed using item navigator 752, the operator 
being used on the item can be changed using operator box 758, and the value can be 
changed using values box 760. In an embodiment, the name, item, and/or condition 
associated with a query object may be edited by clicking on it wherever it appears in user 
interface 700. 

[0078] In an embodiment, objects may be deleted by (1) dragging them off their 
respective windows, or (2) through a right-click, context menu. Alternatively, a trash can 
or other icon may be provided as a delete object, which may be included in addition to or 
as an alternative to either or both of the above two methods of deleting a filter icon and 
its corresponding object. Additionally or alternatively delete button 762 may be used to 
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delete an object from query objects pane 704 and/or query flow pane 702. Preview 
button 764 may be used to preview the expected affect resulting from the change in 
attributes performed using attributes pane 706. When preview button 764 is selected 
query flow pane 702 and query object pane 704 are updated. By providing preview 
button 764, the user is given control of when the preview is updated, thereby avoiding 
unwanted updates that might otherwise be performed before the user has changed all 
items desired, for example. A selection outside object attributes pane 706 may also 
trigger an update of query flow pane 702 and query object pane 704. Advanced link 765 
allows the user to input complex Boolean expressions in the filter object of object 
attributes pane 702, thereby at least partially bypassing the use of graphical input in the 
query flow pane 702. 

[0079] Control and status pane 707 may include various control functions and status 
indications. In the embodiment of FIG. 7, control and status pane 707, via status line 
766, gives the status of the graphical condition builder 700. Some values of status line 
766 may display may be "idle" or "editing query flow," for example. Control and status 
pane 707 may include control bar 768, which in the embodiment illustrated in FIG. 7 
control bar 768 includes a variety of control buttons. However in alternative 
embodiments, control bar 768 may include a variety of different inputs means for 
entering or implementing control functions, such as control buttons, text windows, pull 
down menus, and/or dialog boxes. Undo button 770 of control bar 768 may be used to 
undo the last action such as the addition of a filter object to a query, while redo button 
772 may be used to redo an undone action. Clicking on apply button 774 or OK button 
776 causes a search to be performed. 
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[0080] Graphical condition builders 600 and 700 may also include an SQL dialog box 
that may be invoked from a window level menu item, for example. 

ALTERNATIVE EMBODIMEMT 

[0081] Figure 8 shows another embodiment of a graphical condition builder 800 in 
which the objects, instead of being represented by icons, are represented by tiled boxes 
802 and 810, which are referred to herein as "baskets". The first tiled box 802 is a group 
item, which can be ungrouped using tab 804. The tiled boxes are linked via dataflow 
lines 812 and 814. The contents of the group are shown in window 806 and the name of 
the group is indicated in text indicator 808. The second titled box 810 includes a 
combined condition window 816, a first condition box 818 and a second condition box 
820. Condition windows 818 and 820 include item windows 822 and 824, operator 
windows 826 and 828, value windows 830 and 832, and condition windows 834 and 836, 
respectively. Tiled box 810 and condition boxes 818 and 820 include 
minimize/maximize/close button sets 838, 840 and 842, respectively. Graphical 
condition builder 800 also includes numerical indicators 844, 846, 848, and 850. 
Graphical Condition builder 800 also may include start button 852, submit button 854, 
reset button 856, new condition button 858, and new basket button 860. In alternative 
embodiments, graphical condition builder 800 may not have all of the features listed 
above or otherwise associated with FIG. 8, and/or may have other features in addition to 
or instead of the features listed above or otherwise associated with FIG. 8. 
[0082] Tiled box 810 is labeled Basket 1, and has flow lines 812 and 814 that indicate 
a parallel flow, which indicate that an OR operation is being performed between filter 
objects 818 and 820. Similar to the previous embodiments, the widths of dataflow lines 
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812 and 814 may be varied to reflect the relative sizes of the corresponding collection of 
documents resulting from the application of the filter objects earlier in the query. 
[0083] Combined condition window 816 indicates that an OR operation is being 
performed between condition 3 and condition 4, which are the conditions associated with 
the filter objects associated with condition boxes 818 and 820. Conditions 3 and 4 are 
constructed by selecting an item on which the filter is being placed, which are listed in 
item windows 822 and 824, respectively. The operators being applied are chosen using 
operator windows 826 and 828, and the values associated with conditions 3 and 4 are 
selected using value windows 830 and 832, respectively. Although pull-down windows 
are illustrated as the input means for item windows 822 and 824, operator windows 826 
and 828 and value windows 830 and 832, other input means may be used such as text 
entry windows. 

[0084] Numerical preview indicators 844, 846, 848, and 850 indicate the number of 
items estimated to be found after the filter object they are associated with and any 
previous filter objects have been applied. Thus, numerical preview indicator 844 
indicates the total number of items in the database prior to applying any filters. 
Numerical preview indicator 848 indicates the number of items expected to be found 
after the application of the filter objects associated with tiled box 802 and condition box 
818. Numerical preview indicator 850 indicates the number of items expected to be 
found after the application of the filter objects associated with tiled box 802 and 
condition box 820. Numerical preview indicator 846 is the number of items expected to 
be found by taking the union of the items represented by numerical preview indicators 
848 and 850. 
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[0085] The use of a basket to depict an OR operation emphasizes that the basket 
contains the union of the items of the two condition filters being ORed. The appearance 
of a filter object or a basket may differ depending upon whether the basket or filter object 
is connected to the query or under construction. For example the color of the basket or 
filter object may change once connected to the query. 

[0086] Start button 852 is used to initiate the formation of a query or may be used to 
update the numerical preview indicators. Submit button 854 is used to submit a query 
and initiate a search after the formation of a query. Reset button 856 is used to clear the 
search query from the screen. New condition button 858 and new basket button 860 may 
be used for adding a new condition and new basket, respectively, and may initiate a series 
of dialog boxes or other actions and/or prompts facilitating the formation of a new 
condition and basket, respectively. Although not illustrated, graphical preview sections 
may also be used in conjunction with the embodiment of FIG. 8. 

MULTIDIMENSIONAL DATABASES 

[0087] As mentioned above, graphical condition builders 200, 300, 500, and 800 (of 
FIGs. 2, 3, 5, and 8, respectively) and user interfaces 600 and 700 (of FIGs. 6 and 7, 
respectively) may be used in any context that requires the construction or display of a 
query composed of elements joined by Boolean logic, using the same graphical 
representation of the search query and the same GUIs. Such contexts may include, for 
example, queries to be run against relational databases, queries to be run against 
multidimensional databases, and any other forms of On-Line Transaction Processing 
(OLTP) or On-Line Analytical Processing (OLAP). Each dimension may be added into 



50277-2252 
(OID No. 2002-175-01) 



34 



ORACLE CONFIDENTIAL 

the search by selecting or double clicking a representation of that dimension, for 
example. 

[0088] According to one embodiment, when the graphical condition builders, 
discussed above, is used with a multidimensional database, an estimate for the number of 
data items in the search results may be obtained by calculating the number of cells of 
each dimension that the search is being limited to. In other words, rather than providing a 
preview of the number of results found after each object filter, the number of cells found 
in each dimension is provided as an indication of the number items expected to be found 
by the search query, for example, even though many of these cells may not be populated. 
[0089] For example, a search in a database of sales might be for sales having a date 
between Jan. 1, 2000 and Dec. 31, 2000 and at a location equal to "San Jose OR Boston" 
and by the top 10 customers by number of sales, where the search terms are entered in the 
order they are described in this sentence. Assume that the database extends from the end 
of the year 1992 to the end of the year 2002, has 30 cities as locations, and sells 15 
different products. Assume also that the multidimensional database uses location, date, 
and product ID as dimensions, with cells for entries for each day of the year. The total 
number of cells in the database is 10(365) x 30 x 15= 1624500, where the multiplicative 
factor of 10(365) is from the ten year span of the database, the 30 is from the 30 cities 
and the 15 is from the 15 products. Then the filter for the date limits the number of cells 
to 1/1 0 th the total number of cells (one year of the total 10 years). Consequently, after the 
date filter the number 162450 would be displayed as the estimate for the size of the 
dataset. The city filter further limits the search to 2 cities out of 30 or to 1/1 5 th the total 
number of cities. Consequently, the numerical preview displayed would be 1/1 5 th the 
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number of cells resulting from the output of the date filter or 162450/15=10950. In 
another embodiment, these estimates are further improved by multiplying them by one or 
more further multiplicative factors. In an embodiment, a further multiplicative factor is 
the average density of the cells that are populated, where the average is based on all the 
cells in the multidimensional database. Alternatively, a distribution function giving the 
density of cells populating different regions of the database may be used to obtain the 
multiplicative factor. 

[0090] In an embodiment there is also an estimate used to calculate the output of 
filters that do not pertain to any one dimension per se of the database, such as the top 10 
customers filter of the above example. In another embodiment no numerical estimate is 
given after filters that do not pertain to any one dimension. In yet another embodiment, 
the numerical estimate given after a filter not pertaining to any one dimension is the same 
as the numerical estimate given before that filter, because the total number of cells that 
the search query spans is unchanged. 

HARDWARE OVERVIEW 

[0091] FIG. 9 is a block diagram that illustrates a computer system 900 upon which 
an embodiment of the invention may be implemented. Computer system 900 includes a 
bus 902 or other communication mechanism for communicating information, and a 
processor 904 coupled with bus 902 for processing information. Computer system 900 
also includes a main memory 906, such as a random access memory (RAM) or other 
dynamic storage device, coupled to bus 902 for storing information and instructions to be 
executed by processor 904. Main memory 906 also may be used for storing temporary 
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variables or other intermediate information during execution of instructions to be 
executed by processor 904. Computer system 900 further includes a read only memory 
(ROM) 908 or other static storage device coupled to bus 902 for storing static 
information and instructions for processor 904. A storage device 910, such as a magnetic 
disk or optical disk, is provided and coupled to bus 902 for storing information and 
instructions. 

[0092] Computer system 900 may be coupled via bus 902 to a display 912, such as a 
cathode ray tube (CRT), for displaying information to a computer user. An input device 
914, including alphanumeric and other keys, is coupled to bus 902 for communicating 
information and command selections to processor 904. Another type of user input device 
is cursor control 916, such as a mouse, a trackball, or cursor direction keys for 
communicating direction information and command selections to processor 904 and for 
controlling cursor movement on display 912. This input device typically has two degrees 
of freedom in two axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the 
device to specify positions in a plane. 

[0093] The invention is related to the use of computer system 900 for searching a 
database using a graphical condition builder to formulate the query. According to one 
embodiment of the invention, user input such as selecting an item, double clicking, and 
dragging and dropping icons are provided by input device 914 or cursor control 916 and 
response thereto are provided by computer system 900 in response to processor 904 
executing one or more sequences of one or more instructions contained in main memory 
906. Such instructions may be read into main memory 906 from another computer- 
readable medium, such as storage device 910. Execution of the sequences of instructions 
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contained in main memory 906 causes processor 904 to perform the process query steps 
described herein. One or more processors in a multi-processing arrangement may also be 
employed to execute the sequences of instructions contained in main memory 906. In 
alternative embodiments, hard- wired circuitry may be used in place of or in combination 
with software instructions to implement the invention. Thus, embodiments of the 
invention are not limited to any specific combination of hardware circuitry and software. 
[0094] The term "computer-readable medium" as used herein refers to any medium 
that participates in providing instructions to processor 904 for execution. Such a medium 
may take many forms, including but not limited to, non- volatile media, volatile media, 
and transmission media. Non-volatile media includes, for example, optical or magnetic 
disks, such as storage device 910. Volatile media includes dynamic memory, such as 
main memory 906. Transmission media includes coaxial cables, copper wire and fiber 
optics, including the wires that comprise bus 902. Transmission media can also take the 
form of acoustic or light waves, such as those generated during radio wave and infrared 
data communications. 

[0095] Common forms of computer-readable media include, for example, a floppy 
disk, a flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD- 
ROM, any other optical medium, punch cards, paper tape, any other physical medium 
with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, any other 
memory chip or cartridge, a carrier wave as described hereinafter, or any other medium 
from which a computer can read. 

[0096] Various forms of computer readable media may be involved in storing one or 
more instruction for implementing the graphical condition builder of this application 
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and/or for carrying one or more sequences of one or more instructions to processor 904 
for execution. For example, the instructions may initially be carried on a magnetic disk 
of a remote computer. The remote computer can load the instructions into its dynamic 
memory and send the instructions over a telephone line using a modem. A modem local 
to computer system 900 can receive the data on the telephone line and use an infrared 
transmitter to convert the data to an infrared signal. An infrared detector coupled to bus 
902 can receive the data carried in the infrared signal and place the data on bus 902. Bus 
902 carries the data to main memory 906, from which processor 904 retrieves and 
executes the instructions. The instructions received by main memory 906 may optionally 
be stored on storage device 910 either before or after execution by processor 904. 
[0097] Instructions for running the graphical condition builders of this application 
may be stored on any one of or any combination of the computer readable mediums that 
are accessed by computer system 900. 

[0098] Computer system 900 also includes a communication interface 918 coupled to 
bus 902. Communication interface 918 provides a two-way data communication 
coupling to a network link 920 that is connected to a local network 922. For example, 
communication interface 918 may be an integrated services digital network (ISDN) card 
or a modem to provide a data communication connection to a corresponding type of 
telephone line. As another example, communication interface 918 may be a local area 
network (LAN) card to provide a data communication connection to a compatible LAN. 
Wireless links may also be implemented. In any such implementation, communication 
interface 918 sends and receives electrical, electromagnetic or optical signals that carry 
digital data streams representing various types of information. 
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[0099] Network link 920 typically provides data communication through one or more 
networks to other data devices. For example, network link 920 may provide a connection 
through local network 922 to a host computer 924 or to data equipment operated by an 
Internet Service Provider (ISP) 926. ISP 926 in turn provides data communication 
services through the worldwide packet data communication network now commonly 
referred to as the "Internet" 928. Local network 922 and Internet 928 both use electrical, 
electromagnetic or optical signals that carry digital data streams. The signals through the 
various networks and the signals on network link 920 and through communication 
interface 918, which carry the digital data to and from computer system 900, are 
exemplary forms of carrier waves transporting the information. 

[00100] Computer system 900 can send messages and receive data, including program 
code, through the network(s), network link 920 and communication interface 918. In the 
Internet example, a server 930 might transmit a requested code for an application 
program through Internet 928, ISP 926, local network 922 and communication interface 
918. In accordance with the invention, one such downloaded application provides for a 
graphical condition builder and user interface as described herein. 
[00101] The received code may be executed by processor 904 as it is received, and/or 
stored in storage device 910, or other non- volatile storage for later execution. In this 
manner, computer system 900 may obtain application code in the form of a carrier wave. 
[00102] In the foregoing specification, the invention has been described with reference 
to specific embodiments thereof. It will, however, be evident that various modifications 
and changes may be made thereto without departing from the broader spirit and scope of 
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the invention. The specification and drawings are, accordingly, to be regarded in an 
illustrative rather than a restrictive sense. 
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